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FUNCTIONS OF BOUNDED VARIATION, THE DERIVATIVE OF THE 
ONE DIMENSIONAL MAXIMAL FUNCTION, AND APPLICATIONS TO 
g ; INEQUALITIES 
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Abstract. We prove that if / : / C R — > R is of bounded variation, then the uncentered 
^\ | maximal function Mf is absolutely continuous, and its derivative satisfies the sharp inequality 

\\DM f\\ L n!) < \Df\(I). This allows us obtain, under less regularity, versions of classical 
i inequalities involving derivatives. 
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1. Introduction. 



The study of the regularity properties of the Hardy-Littlewood maximal function was 
initiated by Juha Kinnunen in |Kij . where it was shown that the centered maximal operator 
is bounded on the Sobolev spaces W 1 ' p (M d ) for 1 < p < oo. This result was used to give a new 
t^j- ; proof of a weak-type inequality for the Sobolev capacity, and to obtain the p-quasicontinuity of 
the maximal function of an element of W 1,p (M. d ), 1 < p < oo. Since then, a good deal of work 
has been done within this line of research. In |KiLij a local version of the original boundedness 



result, valid on W 1,P (Q), Q C M. d open, was given. Generalizations were presented in [HaOn 



extending both the global and local theorems to the spherical maximal function for the range 



d > 1, d/{d— 1) < p. The regularity of the fractional maximal operator was studied in |KiSaj . 
Hannes Luiro proved in |Luj the continuity of the Hardy-Littlewood maximal operator on 
W 1 ' p (M d ) (continuity is not immediate from boundedness because of the lack of linearity), 
finding an explicit representation for the derivative of the maximal function. Among other 
articles dealing with related topics we mention |Buj . |Kolj . and [Ko2j. 



As usual, the case p — 1 is significantly different from the case p > 1, not only because 
is not reflexive (so weak compactness arguments used when 1 < p < oo are not 
available for p — 1), but more specifically to this problem, because Mf ^ L 1 (M d ) whenever 
/ is nontrivial, while the maximal operator acts boundedly on LP for p > 1. Nevertheless, in 
dimension d — 1, Hitoshi Tanaka proved (cf. jTaj ) that if / e H^ 1 ' 1 (1R), then the noncentered 
maximal function Mf is differentiable a.e. and ||DM/||i < 2||D/||i. We shall be concerned 
(mostly) with the case d = 1 and p — 1. What had not previously been noticed, and we show 
here, is that the maximal operator can actually improve the regularity of a function /, rather 
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than simply preserving it, and without increasing the variation. This leads to the possibility 
of obtaining, under less smoothness, versions of classical inequalities involving a function and 
its derivatives. 

Recall that if / 6 W /1 ' 1 (1R), then / is absolutely continuous and of bounded variation. We 
refine Tanaka's arguments, obtaining the best possible bound and generalizing it to the class 
of functions of bounded variation. Let I be an interval, let / : I — > R be of bounded varia- 
tion, and let Df be its distributional derivative. Denoting by Mf the noncentered maximal 
function of /, we prove (cf. Theorem 12. 5j) that Mf is absolutely continuous. Hence, Mf is dif- 
ferentiable a.e. and its pointwise derivative coincides with its distributional derivative DMf; 
thus, the latter is a function and not just a Radon measure. Furthermore, the variation of Mf 
is no larger than that of /, in the sense that HDM/H^im < \Df\(I). This inequality is easily 
seen to be sharp. Also, without some assumption of bounded variation type the result fails: 
There are bounded, compactly supported (hence integrable) functions such that Mf is not 
differentiable on a set of positive measure (see Example 14. 2|) . We mention that for bounded 
intervals /, the fact that \\DMf\\ L i {I) < \Df\(I) tells us that Mf : BV(I) -> W^\I) 
boundedly. 

Finally, we note that from the viewpoint of regularity the noncentered maximal operator 
is better behaved than the centered one: The latter yields a discontinuous function when 
applied, for instance, to the characteristic function of [0, 1]. And the same can be said about 
the one directional maximal operators. In higher dimensions and for p > 1, we mention that 
even though Kinnunen stated his boundedness result from |Kij only for the centered operator, 
it also holds for the uncentered one by a simple modification of his arguments, as noted by 
Tanaka in jTaj . Alternatively, boundedness of the uncentered operator can be deduced from 
Theorem 1 of [ HaUnj . 

This paper is organized as follows. Section 2 contains the basic definitions, the main result, 
and corollaries. Section 3, the lemmas used in the proof. In Section 4 examples are presented 
illustrating the basic issues involved and showing that the main theorem is in some sense 
optimal. As an application, in Section 5 we give a variant of Landau's inequality under less 
regularity. Save for the issue of best constants, the inequality we present is stronger than 
Landau's. Of course, this kind of argument can be applied to other inequalities also. As a 
second, simple instance, we present a trivial variant of the Poincare-Wirtinger inequality. 

2. Definitions, main theorem and corollaries. 

By an interval / we mean a nondegenerate interval, so examples such as [a, a] or (a, b) with 
b < a are excluded. But we do include the cases a = — oo and b = oo, so I may have infinite 
length or even be the whole real line. Let A denote Lebesgue measure and A* Lebesgue outer 
measure. When a and b are distinct real numbers, not necessarily in increasing order, we let 
I (a, b) stand for a (nonempty) interval whose extremes are a and b, while if the interval I is 
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given, we use £(I) and r(J) to denote its left and right endpoints. Since functions of bounded 
variation always have lateral limits, we can go from (a, b) to [a, b] by extension, and viceversa 
by restriction. Thus, in what follows it does not really matter whether I is open, closed or 
neither. Nevertheless, if at some stage of an argument it is useful to assume that I(a, b) is of 
a certain type, we shall explicitly say so. 

Definition 2.1. Given P = {x l5 . . . , Xl} C / with X\ < • • • < Xl, the variation of /:/—>• R 
associated to the partition P is defined as 

L 

V(f,P) :=El/(^)-/(^-i)l, 
i=2 

and the variation of / on /, as 

V(f):=supV(f,P), 
p 

where the supremum is taken over all partitions P of /. We say that / is of bounded variation 
if V(f) < oo. 

We use Df to denote the distributional derivative of /, and J, J to denote intervals. Of 
course, if / : I — ► R is absolutely continuous then Df is a function, which coincides with the 
pointwise derivative f'oif. In this case we also denote the latter by Df. 

Definition 2.2. The canonical representative of / is the function 

f(x) := limsup / f(y)dy. 

\(I)->o,xei ^K 1 ) J i 

The Lebesgue differentiation theorem tells us that f = f a.e., so / does represent the 
equivalence class of /. Of course, taking the lim inf would yield a representative as "canonical" 
as the one above; we just selected the one best suited to our purposes. 

It is well known (cf. Lemma f3. 71 in the next section) that if / is of bounded variation, then 
Df is a Radon measure with \Df\(I) = V(f) < oo, where \Df\ denotes the total variation 
of Df. 

Definition 2.3. Given a locally integrable function / : I — ► R, the noncentered Hardy- 
Littlewood maximal function Mf is defined by 

Mf(x) := sup -L f \f(y)\dy. 

If R > 0, the definition of the local maximal function M^f is the same as above, save that 
the intervals J are also required to satisfy XJ < R. 

Remark 2.4. The terms restricted and truncatedh&ve been used in the literature to designate 
Mjif. However, in both cases the meaning differs from the usual notions of restriction and 
truncation of a function, so we prefer local. M R f is genuinely local in that its value at x 
depends only on how / behaves in an ^-neighborhood of x. 
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For some purposes the relevant maximal operator is Mr rather than M (cf., for instance, 
[HaJ). Thus, it seems worthwhile to point out that the results we prove on regularity and 
size of the derivative hold both for M and Mr, essentially for the same reasons. So in the 
statements of the theorems M and Mr will appear, but the proofs will only mention M, 
unless some modification is needed to cover the case of Mr. With respect to the possibility 
of deriving the results for Mr from those of M, or viceversa, it is not clear to us whether 
this can be done, in view of the fact that neither DMf nor DMrJ pointwise dominates the 
other. 

The local maximal function will be used at the end of this paper to prove an inequality of 
Poincare-Wirtinger type. There Mr cannot be replaced by M. 

Theorem 2.5. // / : / — > IR is of bounded variation then Mf is absolutely continuous. 
Furthermore, V(Mf) < V(f), or equivalently, \\DMf\\ L i^ < \Df\(I). The same holds for 
M R f. 

Proof. We assume that < / = /. Since / is upper semi continuous (Lemma 13 .3j) . and 
/ — Mf, the maximal function is continuous ( Lemma 13.41 and Remark l3.5j) . and of bounded 
variation with V(Mf) < V(f) (Lemma 13. 9|) . Also, the image under Mf of a measure 
zero set has measure zero fLemma I3.1(J|) . so by the Banach Zarecki Theorem ( Lemma 13. 2|) 
Mf is absolutely continuous, whence \DMf\(I) = \\DMf\\Lim. Finally, by Lemma 13.71 
\DMf\(I) = V(Mf) and \Df\(I) = V(f), so \\DMf\\ L , {I) < \Df\{I). □ 

Remark 2.6. Actually, the proof of Lemma l3~Ul yields a slightly stronger result: Given / and 
Mf on /, and any subinterval J G I such that the endpoints of J belong to {Mf = /}, we 
have V(Mf\j) < V(f\j). That is, the variation fails to grow not only when considered over 
the whole interval, but also over a wide class of subintervals. The key to this reduction of the 
variation is the rather simple behaviour of Mf on the components of the open set {Mf > /}: 
If (a,/3) is any such component, then either Mf is monotone there, or there exists a c in 
(a, (3) such that Mf is decreasing on (a, c) and increasing on (c,/3). 

We recall the definitions of the space BV(I) of integrable functions of bounded variation 
and of the Sobolev space W 1)l (I). 

Definition 2.7. Given the interval /, 

BV{I) :={/:/- R\f G L l {I) and \Df\{I) < oo}, 

and 

W 1 ' 1 ^) :={/:/-> R\f G L\I), Df is a function, and Df e L\l)}. 

It is obvious that W 1)l (I) C BV(F) and that the inclusion is proper. The Banach space 
BV(I) is endowed with the norm := \\f\\^(i) + \Df\(I), and W^\I), with the 

restriction of the BV norm, i.e., ||/|| w i,i(j) := Wfh^i) + \\Df\\ L i {I) . 

Remark 2.8. It is well known that for every / G L 1 ^) and every c > 0, c\({Mf > 
c}) < 2||/||li(/), so by Theorem 12 .5[ the maximal operator satisfies a "mixed type" inequality 
on BV(I), weak type for functions and strong type for their derivatives. Hence M maps 
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BV(I) into the subspace of L 1,oc consisting of the functions whose distributional derivative 
is an integrable function. But if I is bounded, something stronger can be said: Mf maps 
boundedly BV(I) into W^{I). 

Corollary 2.9. Let I be bounded. Then there exists a constant c = c(I) such that for every 
f G BV{I), Mf G W X >\I) and ||M/|| w i,i (J) < c||/|| w(/) . 

Proof. By Sobolev embedding for BV functions (cf. Corollary 3.49 p. 152 of |AFPj ). ||/||oo < 
c{I)\\f\\ B v(i), so 

IWIk(7)<A(/)||/||oc<A(^c(J)||/|| ww . 

□ 

After this article was completed, the authors were able to show (cf. |AlPej . Theorem 2.7) 
that the local maximal operator Mr is bounded from BV(I) into W l,l {I) even if I has infinite 
length; in addition, the bounds grow with R as 0(}ogR). 

3. Lemmas. 

Given / : / — > R, define the upper derivative of / as 

D / (x)=limsup /(X + ^' /(x) . 

The next lemma is well known and we include it here for the reader's convenience. 

Lemma 3.1. Let f : / — > K be a continuous function, and let E C [16/: < fc}. 

Then X* (/ (E)) < k\*E. 

Proof We may assume that I is open. Fix e > 0, and let O C I be an open set with 
E C O and AO < X*E + e. For each x G E, pick y so that the closed interval I(x, y) is 



contained in O and for every z G I(x,y), 



< k + e. Then V := {I(x,z) : x G 



E,z E I(x,y) C 0} is a covering of with A(UV) < AO < X*E + e. By continuity of 
/, for every pair {x, z} the set f(I(x,z)) is connected, hence an interval or a point. Let 
C := {c G E : for some I(x,z) G V, f(I(x, z)) = c}. Suppose Ci,C2 G C and Ci 7^ C2. 
Then / _1 ({ci}) and / _1 ({c2}) are disjoint sets and each contains an interval, from which 
it follows that C is at most countable. Now if z — > x then f(z) — > f(x), so the collection 
/(V) := {f(I(x, z)) : I(x,z) G V and / is not constant on I(x, z)} is a Vitali covering of 
f(E)\C. Hence there is a disjoint subcollection {/„} of /(V) such that A* (f(E) \ U n / n )) = 0. 
For each n, select I(x n ,z n ) such that f(I(x n ,y n )) = I n . Then 

A* (f(E)) < HQ < J2( k + e )Kl^n, y n )) <(k + e)XO <{k + e)(X*E + e). 

n n 

□ 
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The following lemma is the direction we need of the Banach Zarecki Theorem (an "if and 
only if result). It is an immediate consequence of the Fundamental Theorem of Calculus 
for the Lebesgue integral, the a.e. differentiability of functions of bounded variation, and the 
preceding lemma. 

Lemma 3.2. Let f : / — > R be a continuous function of bounded variation. If f maps 
measure zero sets into measure zero sets, then it is absolutely continuous. 

Let / : I — > R be a function of bounded variation. Then |/| is also of bounded variation, and 
(I) < \Df\(I). Additionally, when studying the boundedness properties of the maximal 
function it makes no difference whether we consider / or |/|. For notational simplicity we 
will often assume that / > 0. 

The next two lemmas are likely to be well known, and small variants certainly are. But 
since we are not aware of any explicit written reference, we include them for completeness. 

Lemma 3.3. The canonical representative f of a function of bounded variation f : I — *■ R 
is upper semicontinuous and of bounded variation. Furthermore, f minimizes the variation 
within the equivalence class of f . 

Proof. Recall that a function / of bounded variation (being the difference of two monotone 
functions) has left and right limits at every point. To see that / is upper semicontinuous, 
simply note that f(x) = max-flim^ f(y), lim^. f(y)}, so for every x G I and every sequence 
{x n } in / converging to x, limsup n f(x n ) < f(x). And V(f) < V(f) follows immediately 
from the fact that f(x) satisfies the following two conditions: i) If x is a point of continuity 
of /, then f{x) = f(x); ii) if x is a point of discontinuity of /, then f(x) belongs to the closed 
interval determined by the extremes \im y ^ x f(y) and \im y i x f(y). Finally, it is quite obvious 
(or else, cf. Theorem 3.28, page 136 of |AFPj ) that on the equivalence class of /, V achieves 
its minimum value at g iff g satisfies both conditions i) and ii) above. □ 

Next, we consider balls associated to some norm in IR d and the corresponding maximal 
operator. In the local case R will denote the diameter, in accordance with our one dimensional 
convention. While in this paper we only need the case d — 1, we state the next result for 
arbitrary d, as it is likely to be useful in future work. 

Lemma 3.4. Let f : W d — > [0, oo] be locally integrable. If f is upper semicontinuous at 
w G M d and f(w) < Mf(w), then Mf is continuous at w. The same holds for M R f . 

Proof. Since Mf is lower semicontinuous, it suffices to prove the upper semicontinuity of 
Mf at w. The idea is simply to note that if "large" balls are considered near w, increasing 
their radii a little so as to include w cannot decrease the average by much, while if one is 
forced to consider arbitrarily small balls, then the fact that limsup n f(x n ) < f(w) whenever 
x n — > w leads to the same conclusion for Mf. More precisely, we show that given e > 0, 
there exists a 5 > such that for all x G B(w,8), Mf(x) < Mf{w) + e. Fix k » 1 such 
that (1 + l/k) d Mf(w) < Mf(w) + e, and choose r > with f(y) < f(w) + e whenever 
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\y — w\ < 3t. Set 8 = r/k, let \x — w\ < 8, and let B(u,r) be a ball containing x. If 
B(u, r) C {/ < f(w) + e}, then 



1 



f(y)dy<f(w) + e<Mf{w)+e, 



\(B(u,r)) J BM 

while if -B(it, r) n {/ < f(w) + e} c ^ 0, then r > r, and since 10 G -B(w, r + 5), we have 



A(5(«, r)) J BM X(B(u, r)) A(£(u, r + 5)) J B(u>r+S) 



<(l + ~J Mf(w) < Mf(w)+e. 

In the case of M R f, if the large balls already have diameter i?, instead of increasing the radii 
just translate the balls slightly. The easy details are omitted. □ 

Remark 3.5. Actually, we shall apply the preceding lemma to arbitrary intervals / C R 
and not just to R. Nevertheless, the result is stated for R d since when d > 1, there are 
connected and simply connected open sets O C lR d for which it fails, as we shall see. The 
difference stems from the fact that when working on O, the maximal function is defined by 
taking the supremum over balls contained in O. If d = 1, the proof of the lemma can be 
easily adapted to intervals. Alternatively, the result for R implies the general case as follows: 
Given Jci, extend / > to R by setting it equal to zero off /. This changes neither the 
upper semicontinuity of / at points in /, nor the values of Mf on /. Regarding the case 
d > 1, counterexamples already exist when d — 2. For convenience we use the ioo norm 
y)\\oo — max{|a;|, \y\} on R 2 , so balls will refer to the metric defined by || ■ H^. Let 
O C R 2 be the open set (0,2) 2 U (0,3) x (0, 1). Define / := X(o,i] 2 on °- Then / is u PP er 
semicontinuous, Mf > 1/4 on (0,2) 2 , Mf = on (2,3) x (0, 1) (so Mf is not continuous), 
and Mf > f everywhere. 

Lemma 3.6. Let f : I — ► [0, oo) be an upper semicontinuous function such that for every x G 
I, f\ x ) < Mf(x). Suppose there exists an interval [a,b] C I with max{M/(x) : x G [a,b]} > 
max{ Mf(a), Mf(b)}. If c G [a, b] is a point where Mf achieves its maximum value on [a, b], 
then Mf(c) = /(c) = max^g^] f(x). If either f G L 1 ^) or we consider Mr f instead of Mf , 
then the same result holds under the following weaker assumption: Mf ( respectively Mr/ ) 
achieves its maximum value on [a, b] at some interior point, so max{M/(x) : x G (a, b)} > 
max{M/(a),M/(6)} (respectively m&x{M R f(x) : x G (a, b)} > max{M R f (a), M R f(b)}). 

Proof. By Lemma E31 Mf is continuous. Suppose that Mf(c) = max{M/(x) : x G [a,b]} > 
max{M/(a), Mf{b)}. If /(c) < M/(c), by upper semicontinuity of / there exists an open 
interval J := (c - 8, c + 5) such that if x G J, then f(x) < 2 _1 (/(c) + M/(c)). Define 
L(t) = \ f^_ t f on [8, c — a] and R(x) = - J^ +x f on [8, b — c]. By continuity, there exist to 
and Xq maximizing L and R respectively. Since M/(c) > max{ M/(a), Mf(b)}, in order to 
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evaluate Mf(c) we only need to consider intervals properly contained in [a, b], so Mf(c) 
max{L(t ), R(xq)}. Suppose without loss of generality that L(t ) < R(x ). Then 



i rc+xo i / pc+8 



Mf(c) = - f < - / 2- 1 (/(c) + M/(c)) + / Mf(c) < Mf(c), 

x Jc x \Jc Jc+S J 

a contradiction. 

Suppose next that Mxf achieves its maximum value on [a, b] at some interior point, so 
max{M T /(x) : x G (a, b)} > max{M r /(a), M T f(b)}. Define L on [0, min{T, c - £(I)}} and 
R on [0,min{T, c + as above, and conclude that there exist to an d Xq maximizing L 

and R respectively. Then argue as before. If / G L l (I) we reason in the same way. Suppose 
for instance that I = R. Then lim^oo L(i) = lirn^oo = 0, so again there are points to 
and Xo maximizing L and i?. The case of bounded or semi-infinite intervals is easily handled, 
assuming, for instance, that finite extremes belong to J, and concluding as before that to and 
xq exist. □ 

Lemma 3.7. Let /:/—>• [0, oo) fre o/ bounded variation. Then V(f) = \Df\(I). 

Proof. This is well known, and it follows from |AFPj . Proposition 3.6 p. 120 together with 
Theorem 3.27 p. 135 (making the obvious adjustments in the definitions, to take into account 
that we do not assume / G L l (I)). □ 

In what follows we will distinguish between the points where one needs to consider ar- 
bitrarily small intervals to obtain the value of the maximal function, and those where the 
supremum is achieved by looking at intervals of length bounded below. Since at this stage 
we are not assuming that / is integrable, it might happen that for some increasing sequence 
of intervals /„ containing x, and for instance, with I n ] I = [a, oo), Mf(x) > J f f 

but Mf(x) = sup n j^-j fj f. As a shorthand to describe this situation, we write Mf(x) = 

-^jy jj f, and similarly for other intervals of infinite length. Now set 

E := |x G R : there exists an interval / containing x such that Mf(x) = ~^Jj J / 1 • 

In the remaining of this section we assume once and for all that f = f whenever / is of 
bounded variation. Then the Lebesgue theorem on differentiation of integrals together with 
< / = /, entail that E c C {Mf = /}. Next, write 

E n := |x G R : there exists an I such that x & I, Mf(x) = JTj^ J /> an< ^ -M-0 — — j" • 

If we are dealing with Mr/, the sets E and E n are defined as before save for the fact that 
we add the extra condition A(7) < R (so for instance, if R — 1/2, E\ — 0). As usual, Lip(g) 
denotes the Lipschitz constant of a Lipschitz function g. 

Lemma 3.8. Let f G L°°(J). Then the restriction of Mf to E n is Lipschitz, with Lip(M/) < 
n||/||oo- The same holds for M R f . 
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Proof. Fix x,y G E n . By symmetry, we may assume that Mf(x) > Mf(y). By hypothesis, 
there exists a J containing x such that A(J) > ^ and Mf(x) = -^jy fjf- Our notational 

convention allows J to have infinite length, so we suppose first that A (J) < oo. Then 

Mf(x)-Mf(y) < ijjjfjf- xjj^-yj jf _ xtfjljf < ll/lloo ^ 
|x — y| ~~ | a; — y\ X(J) + \x — y\ ~ A(J) + \x — y\ 

The case A (J) = oo is obtained by an easy approximation argument. 

With respect to M R , the reasoning is similar. Suppose that x, y G E n , with x < y. If 
y — x > 1/n, then 

|M fl /(x)-M fl /(y)| < H/lloo -0 ^ 



|x — ?/| y — x 

So assume that y — x < 1/n and M R f(x) > M R f(y) (the case M R f(x) < M R f(y) is handled 
in the same way). By hypothesis there exists an interval [a, b] containing x such that 1/n < 
b — a < R and 

r-6 



M R f(x) = J— f f. 
b~aj a 



Now if ?/ — a < R we can repeat the argument given for Mf, so suppose y — a > R. Writing 
c := R — b + a, we have 



M R f(x)-M R f(y) < bhLf-R-Jy-nf < ( b ~ a ) L f ~ ( b ~ a ) J y - R f + ? L f 
y — x _ y — x ~ (|/ — x)(b — a)R 



< (tz-^-^H/IU + cII/IU = /; 
(y — x)R y — x i? 



□ 



Lemma 3.9. Le^ / : / — ■> [0, oo) fre an upper semicontinuous function such that for every 
x G /, /(x) < Mf(x). Then V(Mf) < V(f). In particular, Mf is of bounded variation 
whenever f is. The same results hold for M R f . 

Proof. We show that Mf varies no more than / on {Mf > /}, and of course the same 
happens on {Mf > f} c = {Mf = /}. Note that {Mf > /} is open in /, since Mf - f is 
lower semicontinuous, being the difference between a continuous and an upper semicontinuous 
function. Let (a, (3) be any component of {Mf > /} (here we can either assume directly 
that I is open, or else consider as possible components intervals of the form [a, (3) and (a, /3], 
which can be handled in the same way as we do below). We will see next that on (a, (3), Mf 
can only behave in one of two ways: Either Mf is monotone, or there exists a c G (a, 0) 
such that Mf decreases on (a, c) and increases on (c,/3). Suppose towards a contradiction 
that for some points ci, 02,03 with a < c\ < c% < c 3 < (3 we have M/(ci) < Mf '(02) 
and Mf(cs) < Mf (02). By changing C2 if needed, and relabeling, we may assume that 
M/(c 2 ) = max{M/(x) : x G [c x ,c 3 ]}. Then c 2 G {Mf > /}, and M/(c 2 ) = /(c 2 ) by Lemma 
I3.6I The result for M R f also follows from Lemma f3.6l 
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To show that V(Mf) < V(f), given an arbitrary partition {x±, . . . ,xl} of / we produce 

a refinement {y u . . .,y K } such that \ M f{Vi) ~ M f{Vi-x)\ < \f(Vi) - f(Vi-i)\- We 
always assume that partitions are labeled in increasing order. Of course, if {xi, . . . , £l} C 
{Mf = /} there is nothing to do. Otherwise, for each 2j £ {Mf > /} there is a unique 
component (a iy /%) which contains it. Add both endpoints ctj and $ to the partition. If Mf is 
monotone on (c^, (3j) , we do not add any new points inside this interval. If Mf is not monotone 
on (ai,(3i), then there exists a c$ £ such that Mf decreases on (c^q), increases on 

(ci,Pi), and Mfici) < min{M/(aij), M/(/3j)}. If q does not already belong to the original 
partition, include it in the refinement. The resulting finite collection P := {yi, . . . ,yx} has 
the desired property, as we shall see. First we study what happens in each For every 

pair yi < y i+ k of points of P that are endpoints of some component (a^,^) and contain one 
or more elements of P between them, say yi < yi + \ < ■ ■ ■ < yi+k, either Mf is monotone on 
that component, and then 

i+k 

\Mf( Vj ) - Mf( yj ^)\ = \Mf(y i+k ) - Mf{ Vi )\ 

j=i+i 

i+k 

= \f(y i+k )-f(yi)\< E \f(yj)-f(yj-i)\, 

j=i+i 

or Mf achieves its minimum value on [yi,y i+k \ at some intermediate y im and Mf(y im ) < 
mm{Mf(yi),Mf(y i+k )}. In this case 

i+k 

E W^Vi) - M /(%-i)l = \ M f(y*+k) - Mf(y im )\ + \Mf(y im ) - Mf( Vl )\ 
j=i+i 

i+k 

< \f(y i+k ) - f( yi J\ + \f(y im ) - f{yi)\ < E " 

Finally, for each pair {j/j, yj+i} not already taken into account, we have \Mf(yi+x)—Mf(yi)\ = 
\f(yi+i) — f(yi)\, so the conclusion follows. □ 

Before proving the next lemma, we mention that on large parts of its domain Mf is locally 
Lipschitz. Of course, matters would be considerably simpler if Mf were locally Lipschitz at 
every point, but unfortunately this need not be the the following example shows: 

Take f(x) = (1 — \/x)x\o,i] ( x ) an d note that Mf fails to be locally Lipschitz at 0. 

Lemma 3.10. Let f : / — > K. be a function of bounded variation, and let N be a set of 
measure zero. Then A (Mf (N)) = 0. The same results hold for Mr/ . 

Proof. Suppose iV has measure zero, and let E n and E be the sets whose definition appears 
just before Lemma 13.81 Since E n f E, by Lemmas 13.11 and 13. 8[ for each n = 1,2,..., 
X(Mf(N n E n )) = 0, so X(Mf(N n E)) = 0. Thus, we may assume that N C E c , whence 
iV C {Mf = /}. We are going to make further reductions on N. First, we remove from it all 
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the intervals I a where Mf is constant; we can do that since XMf(U a I a ) = by Lemma f3.il 
(note that the difference between considering closed or open intervals is at most a countable 
set of endpoints, so the exact nature of the I a 's is of no consequence here). And second, 
we eliminate from N \ U a I a a countable set in such a way that every remaining point is a 
point of accumulation. This can be done by a well known argument: Pick a countable base 
B of intervals, and let {Ij} be the collection of all intervals in B for which (N \ U a I a ) fl Ij is 
countable. Then (N\U a I a ) \ Ujlj has the desired property. For the usual reason of notational 
simplicity, we use N again to denote this thinner set. 

Now we are ready to suppose, towards a contradiction, that X*Mf(N) > 0. Write 8c : = 
X*Mf(N). We show that for every finite sequence of distinct real numbers {x\, . . . , xl] in J, 
labeled in (strictly) increasing order, there is a refinement {yi, . . . , yx} in / with yi < y 2 < 

■ • • < yx and Yla \ f( x i) ~ f( x i-i)\ + c < \f(Vi) ~ f(Vi-i)\- This contradicts the fact that 
/ is of bounded variation. 

The final partition will be produced in several stages, so at any step in the argument, we 
use P to denote the partition already at hand, and P' the immediate refinement obtained 
in that step. So P will denote different partitions at different stages, and the same happens 
with P'. 

By adding more points if needed, and relabeling in increasing order, we may assume that 
"a large part" of N is contained between the first and the last points of the partition (call 
them A and B respectively). By this we mean that 7c < X*Mf(N fl [A, B]). Again we use N 
to denote the null set N fl (A, B), and {xi, . . . ,xl} to denote the points of the new partition. 

When we say that an interval J is determined by P we mean that its extremes are consec- 
utive points in P. Let I\ be the first (according to the real ordering) of the open intervals 
determined by P := {xi, . . . , xl} such that X*Mf(NC\Ii) > 0, and let x i± < x il+ i be the end- 
points of this interval. For each x G N fl I\ and n 6 N pick y n G N fl l\ such that lim n y n = x 
and \x — y n \ < 2~ 1 d(xi 1+ i, I(x, y n )) (where d stands for distance, and I(x,y n ) for the compact 
interval with extremes x and y n ). In the case of M^f we additionally require that for every 
x and every y n in its associated sequence {?/ n }^ , 2\x — y n \ < R. Since Mf is continuous 
and nonconstant in all of those intervals, Mf(I(x,y n )) is a nondegenerate compact interval 
and lim n X(Mf (I (x,y n ))) = 0. Thus the collection V := {Mf(I(x,y n )) : x G N,n e N} is 
a Vitali covering of Mf(N fl Ii). Furthermore, if J is the closed interval with the same left 
endpoint as I(x,y n ) and twice its length, then J C I\. Select a finite, disjoint subcollection 
{Si, . . . , Sr} from V, such that 

(3.1) A* (Mf(N n h) \ (Si U • • • U S R )) < 7- 1 X*Mf(N n h). 

For each i = 1,. . .,R pick J; G {I(x, y n ) : x G N D I x , n G N} with Mf(Ji) = 

Without loss of generality, suppose that /(x^) < f(x il+ i). Adding a finite number of 

points between x^ and to the original partition P does not increase the variation if / 

is behaving monotonically there. Thus, our strategy consists in selecting new points so that 

"broken line configurations" are obtained sufficiently often. 
We consider two cases. In the first, the increase in the variation is obtained by adding to 

the original partition the endpoints of suitable intervals, and either one or two points inside 
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each such interval. In the second, we add the endpoints of intervals not considered in case 1, 
together with either one point outside each such interval (but close to it), or no additional 
point. 

Note that / and Mf take the same values on the endpoints of the intervals Jj, since 
£(Ji),r(Ji) G iV for every i = l,...,R. 

Case 1. Call an interval J of type A if |M/(r(J;)) - Mf(£(Ji))\ > 2~ 1 \S i and there exists 
a a G Ji with f{ Ci ) < min{/(£(^)),/(r(J,))} - 2~ 1 |/(r(J i )) - f(£(Ji))\. Note that if we 
have any partition P with consecutive points a < b and Jj C (a, b), then adding £(Ji), q and 
r(Ji) to P leads to V(f, P) + \\Mf{J t ) < V(f, P') (where P' is the refinement so obtained), 
regardless of the values of f(a) and fib). 

We say that J { is of type B if \Mf(r(Ji)) - Mf(£(Ji))\ < 2~ 1 \S i . Suppose in this case 
that Mf\j. achieves its extreme values at m\,m 2 G Jj, with rrii < m 2 - We add the distinct 
elements in {£( Jj), m 1 , m 2 , r( Jj)j to P, obtaining P' (it may happen that either £(Ji) = mi 
or m 2 = T(Jj), but not both, so P' contains either 3 or 4 points more than P). Note that 
XSi = \Mf(m 2 ) — Mf(m\)\ < \f(m 2 ) — /(mi)|, since where the maximum of Mf occurs, 
Mf and / take the same value (by Lemma [3 .6! if the corresponding is an interior point, 
and by N C {Mf = /} otherwise) while / < Mf always. As before, for some pair of 
consecutive points a < b in P we have Jj C (a, b). It is again clear that no matter what the 
positions of f(£(Ji)), /(mi), f(m 2 ), and f{r(Ji)) are relative to f(a) and /(&), we always have 
V(f,P) + ±\Mf(Ji)<V(f,P'). 

Suppose now that the collection of intervals J^ of type either A or B satisfies J2j — 

3 _1 Xlili Adding to the initial partition all their endpoints and the interior points 

corresponding to each case we get 

1 R 
i 

Case 2. If the case previously considered does not hold, then the set of intervals Jj such 
that \ f{r{Ji 3 )) - f{£{J %] ))\ > 2~ 1 \S ij and for all z G J ij5 f(z) > min{/(^(J i ,)), /(r( J- 3 ))} - 
2- 1 \f(r(J i .))-f(£(J ij ))\, satisfies £\ ^ > |Ef=i^. Call J,, order preserving if /(£( J,. )) < 
/(r(Jj j )) and order reversing if f(£(Ji 3 )) > f(j(J i:j )). We consider two subcases. In the 
first, the subcollection of order reversing intervals, which we rename as {A%, . . . , Aq} is 

large: Yj%\ ^ M f( A j) > | Ef=i We add the points {£(A X ), r(Ai) . . . , £(Aq), r(A Q )} 
to the partition P and note that with this refinement the variation increases by more than 
| ASj. In the second subcase, the subcollection of order preserving intervals, which again 
we denote by {Ai, . . . , Aq}, satisfies Y^=i ^Mf(Aj) > | J2f=i Since for every w G Ai, 
fW > f{£{A x )) - (/(r(A0) - /(*(A0))/2, there exists a Cl G [r(A0, 2r(A x ) - £(^0] such 
that /(ci) < (/(r(^4i)) + f(£(A 1 )))/2. Otherwise, we would have that 

i r2r{Ai)-t(A 1 ) 

2{r{A 1 ) - £{A X )) J i{M) 
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contradicting the assumption that iV C E°. Recall that 2r(Ai) — £(Ai) < x^+i, so c\ G l\. 
Add £(Ai), r(Ai), and c\ to P, together with the endpoints of all intervals {A 2 , . . . , A n m} C 
{Ai, . . . , Aq} contained in [r(Ai), ci] (if there is any). Then 

1 n(l) 

(3.2) V(f, P) + -AM/(A0 + ^ AM/(A S ) < P')- 

2 

Go to the next A q not already considered and repeat the process, relabeling the points in 
increasing order if needed. It may happen that £(A q ) < C\ < r(A q ), so £(A q ) is added to the 
left of the point c\, already in the partition. But this does not harm any estimate, since the 
intervals Mf{A\) and Mf(A q ) are disjoint: If Mf(A q ) lies below Mf(Ai), by considering 
the points £(Ai),r(Ai) and £{A q ) it is easily seen that the summand |AM/(Aj) in ()3.2|) can 
be replaced by XMf(Ai), while if Mf(A q ) lies above Mf(A\), then it is more advantageous, 
from the viewpoint of guaranteeing the increase in the variation, to have £(A q ) < c\ instead 
of c\ < £{A q ). After a finite number of steps the list {Ai, . . . , Aq} is exhausted, and we get 

R Q 

So regardless of whether we are in case 1 or case 2, by ()3.1|) we always obtain a new partition 
P' of with 

V(f, P) + l -\*Mf{N n ii) < V(/, P'). 

Since all the points in P' \ P have been chosen within Ii, we can repeat the argument with 
every other interval J determined by the first partition {x\, . . . , xl}, for which \*Mf(NC\J) > 
0. In this way, a refinement {yi, . . . , yx} of {x±, . . . , xl} is produced, such that 

L K 
2 2 

□ 

4. EXAMPLES. 

This section presents several examples in order to illustrate some of the issues involved and 
why different assumptions in the preceding results are needed. 

Example 4.1. There exists an upper semicontinuous function f (with unbounded variation) 
such that Mf is not continuous. 

Proof: Let / be the characteristic function of the closed set {0} U |J^ =0 [3/2 n+2 , 1/2™]. Then 
Mf(0) < 1/2 while limsup a .^ Mf(x) = 1, so Mf is discontinuous at 0. □ 

This example shows that the hypothesis f(w) < Mf(w) in Lemma 13.41 is necessary. We 
also mention that the canonical representative of / is not upper semicontinuous even though 
/is. 
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In the next example we will follow the convention of identifying a set with its characteristic 
function, thereby using the same symbol to denote both. 

Example 4.2. There exists a bounded, upper semicontinuous function f with compact support 
(and unbounded variation) such that Mf is not differentiable on a set of positive measure. 
In particular, Mf is not of bounded variation. 

Proof: We shall show that there exists a fat Cantor set C such that MC is not differen- 
tiable at any point of C D {MC =1}- By a Cantor set C we mean an extremely disconnected 
compact set such that all its points are points of accumulation. Since C is closed, its charac- 
teristic function C is upper semicontinuous, and obviously of unbounded variation. Note that 
MC(x) = 1 for almost every x G C, so if MC is differentiable at any such x, we must have 
DMC(x) = 0. If fact, C will be chosen so that on C fl {MC = 1} the difference quotients 
diverge in modulus to infinity. 

By fat we mean of positive measure. We shall construct C C [0,1] so that AC > 2/3. 
The main difference with the usual Cantor set is that instead of removing the "central part" 
of every interval at each stage, we remove several parts. Let F = [0, 1] and let F n be 
the finite union of closed subintervals of [0, 1] obtained at step n of the construction, to be 
described below. As usual C := fl n F n . Obviously MC < MF n ; the function MF n is the 
one we will actually estimate. At stage n we remove the proportion 2~ 2n of mass from the 
preceding set, i.e., \F n = (1 - 2~ 2n )AF n „i, so AFi = 3/4, XF 2 = 45/64, et cetera. Then 
AC = lim n AF n = 1 - lim n Ai^ > 1 - f^ =1 2~ 2n = 2/3. Next we ensure that for each n 
"mass" and "gaps" are sufficiently mixed. Let I n -i be a component of F n _i. Subdivide 
J n _i using the 2 2n + 1 equally spaced points ^(J n _i) = xi,£2, • • • ,^2 2 ™+i = r (In-i), and then 
remove the 2 2n + 1 open intervals 0(n,Xi) of length 2 _4n A/ n _i, centered at each Xj, noting 
that the first and last intervals deleted lead only to a decrease in mass of 2 _4n_1 A/ n _i each. 
Do the same with the other components of F n _i to obtain F n . Then all subintervals left 
have the same length. Clearly, the largest average at the points Xi is obtained by considering 
intervals as large as possible but without intersecting any other deleted interval O, so 

Fix z G C R {MC = 1}. For each n, let I n>z be the component of F n that contains z, 
and let w n be midpoint of the nearest interval 0(n) deleted at step n (if there are two such 
midpoints, pick any). Then \z — w n \ < 2~ n ( n+1 \ since the number of components of the set 



is Iir=i 22i = 2"(" +1 ). Therefore 
MC(z) - MC(w) 



lim sup 



z — w 



1-MCK) 2- 2 "- 1 
> hmsup — : : — > lim 7 — r- = oo. 

\z-wJ ~ n^oo 2-™(™+ 1 ) 



n^oo 



□ 



Remark 4.3. At present it is not clear to us how Mf behaves in higher dimensions. This 
was asked in [HaOnJ for W 1,1 (M. d ) (Question 1, p. 169). Note that when d > 1, a function / 
of bounded variation need no longer be bounded, it may not have an upper semicontinuous 
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representative, and Mf may fail to be continuous even if / is bounded, of bounded variation, 
and upper semicontinuous, as the next example shows. Furthermore the equivalence between 
the pointwise variation V(g) of a function g and the size \Dg\(R d ) of its distributional de- 
rivative no longer holds; in fact V(g) is essentially a one dimensional object, and there is no 
corresponding notion for d > 1. All of this means that even if the results in dimension one 
continue to hold when d > 1, no straightforward extension of the arguments presented here 
is possible. 

Example 4.4. There exists a bounded upper semicontinuous function f G BV(M?) such that 
Mf is not continuous. 

Proof: Let / be the characteristic function of the closed triangle with vertices at (0, 0), (1, 2), 
and (2, 1). Then it is easy to check that the noncentered maximal function associated to any 
£ p ball has a point of discontinuity at the origin. The same happens if we consider the 
(noncentered) strong maximal operator (where averages are taken over rectangles with sides 
parallel to the coordinate axes). □ 

Remark 4.5. Standard applications of the maximal function in the context of LP spaces, for 
p > 1, use the fact that Mf dominates |/| pointwise, and hence in norm, but the norm of 
Mf is not much larger than that of /. While the latter fact is still true in Sobolev spaces 
by Kinnunen's theorem, Mf may fail to control / in norm, as the next example shows. This 
points out to the fact that applications of Mf in the theory of Sobolev spaces will tend to 
differ from the usual ones in LP . One such application, explored below, consists in trying to 
replace Df by DMf in inequalities involving a function and its derivatives. Here having a 
smaller norm may in fact be advantageous. 

Example 4.6. For 1 < p < oo there exists an f G H /1 ' p ((0, 1)) such that \\Mf\\w^p((o,i)) < 
ll/llwMCo,!))- 

Proof: Let N» 1, set f(x) = 1 for x G (0, 2" 1 - N' 1 ) U (2" 1 + N~\ 1), f{2~ 1 ) = 0, and 
extend / piecewise linearly to a continuous function on (0, 1). Then / works as advertised, 
since Mf is "close to being constant" (more precisely, Mf is constant save on the middle 
interval of length 2iV~ 1 , where it is Lipschitz: Lip(M/) < 2 - 1 ~n- 1 by Lemma IH.Hj) and 
\\Mf — f\\ p is close to zero for 1 < p < oo (making N depend onp), while \\Mf— fW^ < 1. □ 

The preceding example can easily be modified to obtain the same result in W /1 ' P (IR) for 
p > 1. Also, by fixing p and letting N go to infinity, one obtains a sequence {/at} with 
H-^/jvIIw 1 '* < c < oo and H/zvHw 1 .* T 00 • So there is no uniform domination of fx by Mf N , 
even up to a constant. 

5. Applications. 

While the maximal function is a tool of every day use within the real variable methods in 
harmonic analysis, its importance in the theory of differential equations and Sobolev spaces 
has been considerably smaller. This may start to change as the regularity properties of the 
maximal function are being uncovered. Here we use our main result to prove inequalities 
involving derivatives under less regularity, a novel kind of application. It is convenient in 
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the context of Landau's inequality to adopt the convention oo • = oo, (otherwise if u is 
unbounded and u' is constant, the right hand side of the inequality below is undefined). For 
the real line, the sharp Landau inequality states that given an absolutely continuous function 



u' 



\u'f 00 <2\\u\\ 00 \\u"\ 



Nowadays Landau's inequality (later generalized by Kolmogorov by considering higher 
order derivatives) can be regarded, save for the issue of best constants, as a special case of 
the Gagliardo-Nirenberg inequalities. Pointwise estimates in Landau's (and Kolmogorov's) 
inequality involving the maximal function or some variant of it are known, cf. |Kaj . |Ma Shl , 



and [MaSh2j. Here we present a norm inequality, involving the derivative of the maximal 
function rather than the maximal function of the derivative. As usual, /' denotes the pointwise 
derivative of a function /, while / + := max{/, 0} and /~ := max{— /, 0} stand for its positive 
and negative part respectively. We shall use /' and Df indistinctly when / is absolutely 
continuous. Note that Mf may be constant even if / is bounded, nonnegative, and not 
constant. 

Theorem 5.1. Let I be an interval with infinite length, and let u : I — > R be an absolutely 
continuous function such that V(u') < oo. Then 

(5.1) Wu'Wl < 48|MU (\\DM{u' + )\\ 00 + IIDMOOIU) . 
IfI = R, then 

(5.2) Wu'Wl < 24HUHOO (\\DM(u' + )\\ 00 + IIDMOOIU) . 

Proof. Suppose / has infinite length, ||w||oo < oo, and ||w'||oo > 0. We claim that for every 
t > 0, 

(5.3) IKIU <max j^|H| 0C ,6t(||DM( M /+ )|| 0O + pM^IU) j . 

It follows, by letting t -> oo, that ||DM(u' + )|| QO + WDMiu'-)^ > 0. Setting 

4||d 



3(\\DM(u'+)\\ 00 + WDMiu'-)^ 
we obtain (|5.1|) . 

To prove the claim we distinguish two cases, depending on which term of the right hand 
side controls the left hand side. We may assume that I is closed (otherwise we just extend u 
to the closure of I using uniform continuity). Fix t > and a G (5/6, 1). Select xo G I such 
that max{M(u' + )(x ), M{u'~)(xq)} > aWu'W^. Without loss of generality we may assume 
that M(u' + )(x Q ) > alKHoo. 

Case 1. Suppose there exists a y G [xq — t, xo + t) PI / such that M{u' + )(y) < |||w'||oo- By 
Theorem 12.51 M{u' + ) is absolutely continuous, so 



\M(u l+ )(x ) - M(u'+)(y)\ 



DM(u ,+ ) 



< \\DM(u' + )\Ux -y\ 
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Hence we have 



(5.4) IpM^IU^ 



M(u ,+ )(x ) - M{u'+){y) 



x -y 



> (Q- 5/6) IKI 



Case 2. For all y G [x — £, x + t] fl / we have M{u' + ){y) > g 1 1 it' 1 1 do j so there exist a, 6 G 
with a < y < b such that 



A(J n (a, &)) i 7 n(a,6) 6 
Write := / fl (a, 6) (of course, a and 6 depend on ?/). Then 

\(l y n{u'+ = o})<±\(i y ). 

Now {i^ : !/ 6 [xq — t, x + t] R J} is (in the subspace topology of J) an open cover of the 
compact interval [x — t, x + 1] fl /, so the latter set has a finite subcover {/i, J 2 , . . . , Jjv}- By 
further refining the collection, if needed, we may assume that for every x G U^Zj, 

N 



1=1 



(if a point belongs to three intervals, at least one of them is contained in the union of the 
other two, so discard it). Then 



(5.5) 2|MU > / «-= / . «< + - ! u- > \y. I «' + - £ 



N N 



- E ( 7^ A ( J - A n K + = o» ) IKIU > ^IKIU E 
1=1 ^ ' i=i 

> ^IKHooACUfJi) > ||«'||oc^ 

To obtain (|5.Hjl . put together (|5.4jl and (|5.5j) : then let a — > 1. 

If J = R, the same argument yields ||u'||co|) instead of || it'll oof, as the rightmost term in 
(|5.5j) . It is easy to check that this in turn gives (|5.2|) . □ 

For completeness, we state the corresponding result when / is bounded. In this case ()5.3|) 
is replaced by 



llw'Hoo < max — — — — \\u\U6t (||DM(n ,+ )|| 00 + ||Z>M(u'-)lloo) 
for every t > 0. Hence we obtain the following 
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Theorem 5.2. Let I be a bounded interval, and let u : I — > R be an absolutely continuous 
function such that V{u') < oo. If 



A(/)> 



4||d 



we have 
while if 

we get the estimate 



SiWDMiu'+^ + WDMiu'-^Y 
KIlL < 48||«|| 0O (\\DM(u' + )\\ 00 + WDMiu'-)^) 



X(I)< 



i\\u\ 



oc 



SiWDMiu'+^+WDMiu'-)^ 



8 ,, 



A(J)' 

Example 5.3. Working with DMf rather than with MDf may lead to much better bounds, 
as it happens in the following example. Let / : R — > R be the characteristic function of [0, 1]. 
Then f = a.e. and Df = S - 5 U so \Df\(R) = 2. It is easy to check that Mf(x) = x~ l 
if x > 1, M/(x) = 1 if < a; < 1, and Mf(x) = (1 - a;)" 1 if x < 0, so Mf is not just 
Lipschitz, but even better: DMf e 5V(R). As a side remark, we mention that for this / we 
have |D/|(R) = ||DM(/)|| i i( K ), so the inequality in Theorem 12.51 is sharp for R, and in fact, 
for every other interval /: While equality is only achieved on the real line, the constant 1 can 
never be improved, as can be seen by considering the characteristic function of a subinterval 
J G I, and then letting J shrink to a point in the interior of /. 

Now, taking F(x) := f_ f(x)dx, a classical Landau inequality (p = oo) for F, F' and F" 
would fail, since ||.F"||oo — 0. Replacing F" by DF' does not help either, as H-D-F'Hqo makes 
no sense, and a natural definition using regularizations would lead to H-DF'Hoo = oo (in which 
case the inequality would be true but not useful). 

Trying to extend to the setting of functions of bounded variation, the pointwise, maximal 
function versions of Landau's inequality due to Agnieszka Kalamajska, and independently to 
Vladimir Maz'ya and Tatyana Shaposhnikova, would face a similar difficulty: If the distri- 
butional derivative has a singular part, i.e., if the function is of bounded variation but not 
absolutely continuous, then its maximal function will blow up somewhere. In the example 
we are considering, it is easy to see that M\DF'\(x) > max{|x| _1 , \x — 

Even ignoring regularity issues and replacing / with mollified versions of it, or piecewise 
linear continuous variants, the bounds obtained by considering DMF' instead of M\DF'\ are 
distinctly better: Let f n = 1 on [0,1], /„ = on (— oo, — n~ l ) U (1 + n _1 ,oo), and extend 
f n linearly in each of the two remaining intervals, so that f n is continuous. Now ||/n||oo = 1 
and Theorem 15.11 does indeed give a bound uniform in n. However, the bounds obtained 
via the classical Landau inequality deteriorate as n — >• oo, and the same happens on small 
neighborhoods of and 1 with pointwise inequalities using Mf' n . 
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While the function F above is bounded, it is not integrable. To obtain a similar example in 
L 1 fl L°°, let g(x) := f(x) — f(x — 1) and let G(y) := f_ g(x)dx. For a continuous example, 
with singular (and continuous, as a measure) distributional derivative, let / be the standard 
Cantor function on [0, 1], defined using the Cantor "middle third" set C. We extend it to R, 
first by setting f(x) = if x < and f(x) = 1 if 1 < x < 3/2. Then reflect about the axis 
a; = 3/2. Now let #(x) := f(x) -f(x-3). Then G(y) := f y oo g(x)dx belongs toZ^flL 00 , and, 
applying the notation of Lemma 13.81 to M(g + ), it is clear that R C -Ei, so UDMg+Hoo < 1, 
and the same happens with M(#_). Thus, ||£>M(p + ) ||oo + ||DM (#_) ||oo < 2 and once more 
we obtain a finite bound on the right hand side of inequality ([5.2)1 . 

Remark 5.4. It is natural to enquire whether for some constant c the simpler inequality 
(5.6) IKlU^cHulUlpM^OlU 

holds. Fix c > and select N G N such that 1 > c/iV. For = 0, . . . , N — 1, set «' = 1 on 
the intervals (k/N, (2k + l)/(2iV)], u' = -1 on the intervals ((2k + l)/(2N), (k + l)/N], and 
u' = off (0, 1]. Then Hu'll^ = 1, \\u\loo = 1/2N, and \u'\ = X(o,i], so ||D¥(u')||oo = 1. Thus, 
JSU) fails. 

Remark 5.5. As indicated in the introduction, inequality (j5.1|) implies Landau's, though 
not with the sharp constant (in fact, the constant is not even close; the point of course is 
that ()5.1|) can yield nontrivial bounds for some non-Lipschitz, even discontinuous u'). 

Kinnunen showed that for / e W 1 '°°(M d ), \\DMfWn < |p/|U (cf. |Kf, pages 120 and 
121). The same holds for Lipschitz functions on/cl, as we note in the next theorem. While 
the argument is basically the same, formally this theorem does not follow from Kinnunen's 
result since we are considering also the local case / ^ R. So we include the proof. Now 
we have that if vl is absolutely continuous on an unbounded interval / and H-Dw'Hoo < oo, 
then || j DM(m'+)|| 0O + WDM^Woo < \\Du' + \\ OQ + WDu'-^ < 2|pu'|| 0O , so by flHD, IKIU < 
96||w|| 00 ||ZDw. / j| 00 . 

Theorem 5.6. If f : / — > R is Lipschitz, then so is Mf, and Lip(M/) < Lip(/), or 
equivalently WDMfW^ < \\Df 11^. The same holds for Mr/ '. 

Proof Select x,y <E I, and let / > 0. Suppose Mf(x) > Mf(y) and x < y (in the case 
x > y the argument is entirely analogous). If Mf(x) = f(x), then \Mf(x) — Mf(y)\ < 
I/O) - f(y)\ < Lip(/)|x - y\. Otherwise, Mf(x) > f(x), so 

Mf(x)= sup 7— — f f- 

{{a,b]Cl:a<b,a<x<b<y} ~ a Ja 



Now 



Mf(x) - Mf(y) < SU P{[a,b]Cl:a<b,a<x<b<y} la f ~ la+y-b f) 

y — x ~ y — x 

sup \- / (f(t) -f(t + y- b)) dt 

{[a,b]Cl:a<b,a<x<b<y} {V ~ X)\0 — a) J a 
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1 f b 

< sup — -/ (y -b)Up(f)dt < Lip(/). 

{[a,b]Cl:a<b,a<x<b<y} {V ~ X){0 — a) J a 



□ 



In fact, the constant 1 given in the preceding theorem is not sharp. After this paper was 
completed, in joint work with Leonardo Colzani the authors have found the best constants: 
Lip(M/) < 2" 1 Lip(/) for arbitrary intervals, while on R, Lip(M/) < {^2 - l)Lip(/). So 
when deriving the classical Landau inequality from the generalization presented here, the 
constant 96 at the end of Remark 15. 51 can be lowered to 48. 

To finish, we present, again under less regularity, a trivial variant of the classical Poincare- 
Wirtinger inequality, which states that if / : [a, b] — > R is an absolutely continuous function 
with f(a) = f \b) = 0, then 

b rb 

f(x) 2 dx < c / f'(x) 2 dx, 

J a 

where c depends only on b — a. Using the local maximal operator M R , we prove a variant 
of the above inequality, for functions of bounded variation with support at positive distance 
from the boundary. 

Theorem 5.7. Let f : [a, b] — > M be such that V(f) < oo and supp / C [a + R, b — R] for 
some R > 0. Then 

r b t-b 

f(x) 2 dx <c DM R f(x) 2 dx. 

J a 

Proof. By Theorem 12.51 M R f is absolutely continuous since V(f) < oo, and by hypothesis, 
M R f(a) = Mftf(b) = 0, so we can apply the classical Poincare-Wirtinger inequality to Mr/, 
obtaining 

rb rb rb 

f{x) 2 dx < I M R f{x) 2 dx <c (DM R f(x)) 2 dx, 

J a J a 

□ 
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